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There has been a maltiplioatlon of tests In such sohool 
suhjeots as handwriting, algebra, arithmetio, spelling, and 
readings The same is true, thou^ to a less extent, of United 
States history^ The purpose of this thesis is: (l) to list 
the tests in this field and give a short oharaoterization of 
each; (s) to present the problem of a comparative study of 
these existing tests; (3) to describe the methods of this 
study and the results obtained; and (4) to discuss the conclu- 
sions from the abore results* 

In giving the list of tests, only those which cover the 
entire period of United States history are given, for they are 
the ones used in making this particular study and are, with 
one exception, the only tests available for use* 

At present there are six men who have developed at least 
one test in United States history. They are listed below with 
a statement of the character of the test, and where reported 
or where it may be obtained. 

1# Bell and UoCxG.lom-» Informational, asks for date-events, men- 
(arranged by 

L.W.Sackett) events, historical terms, political par- 
ties, divisions of history, map study* 
Heported in Journal of Educational 
Psychology, Till ( May, 1917 ) , 257 
Copies may be obtained from L* W# Sackett, 
University of Texas 

Z. Barr, A# S# - Informational, thought, reasoning. 
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judgment* Cons lets of (1) quotations 

and answers, (S) tests for hlstorioal 
Inference » seleotlon of facts* 

Z. Buckingham, B«B«- Thought and Information* A series of 

thought questions demanding reasoning; 
and a series of Infozmatlonal questions* 
Beported In School and Society > V, 
(April 14, 1917), 443*48* 

4* Starch, 7> -* Informational, completion type* Senten- 
ces concerning personages, dates, erents, 
and historical terms In which blanks hare 
been left to be filled In \>j students* 
Copies may be obtained from the Co-Opera- 
tire Book Company, Madison, Wisconsin* 

5* Yan Wageman, M*J*« Infomatlonal, thoiight, character, judg- 
ment* Seren tests on these three sub- 
jects asking questions and giving quota- 
tions and asking questions about them* 
Tests and reports of the tests may be 
obtained from the Bureau of Publications, 
Teachers* College, Columbia Unlyerslty* 

6* Harlan, C*L* Informational, completion* Ten exercises 

In one test asking for personages, terms, 
erents, hypothetical conclusions, causes 
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and results* Copies may be obtained 
from the UnlTerslty Book Store, Minnea- 
polis^ Minnesota. 
As. has been seen, the majority of these tests In history 
have made use primarily of faots or Infozmatlon of a hlstorl- 
oal nature; the assumption doubtless being that the readiness 
with whloh students give the faots of history Is a measure of 
what Is oalled hlstorloal ability. These tests are based 
largely upon the pjsyohologioal prlnolple of assoolatlon. ?or 

example, Staroh gives as his first statement, " 

dlsoovered Amerloa In 1492.^ The problem of the student Is 
to assoolate the proper word with the rest of the sentenoe 
and fill In the blank. Harlan uses the oompletlon method dif- 
ferently. 7or exasgple. In Szerelse 11, he states, ** Below Is 
a list of terms used In United States history. Grlve a particu- 
lar example of each, suoh as: Battle-Bunker Hill.** Then 

gives a list, 1. Colony , £. Pioneer ^ «ad 

eight others. He also brings In the element of hypothetical 
ooncluslons when he states: **Tell the first thing you would do 
under each of the following conditions, also what you would do 
next: 1. If a neighbor were to present to you for your signa- 
ture a petition to have some man removed from public office,- 
What would you do first? Would you sign the petition? " 
In the Bell and McCollum test the pupil Is required to give 
the Important event occurlng In 1661, 1789, 16S0, etc., to In- 
dicate for #hat certain men were celebrated, to list the poli- 
tical parties and epochs of history, and to trace on an out«» 
line map the territorial acquisitions since the Bevolutlonary 
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The tests of Barr, Buoklngham^ and Tan Wagenen are more 
oomplioated In that the pupil Bach reach to such processes 
as thought, reasoning 9 and judgment* ?or example » Barr in 
his Test 1V» Series A gives this problem, "Fat a cross ( ) be* 
fore the event In the following list that has been of the great- 
est importance in the economic development of the United States, 
(a) The B.R« strikes of 1877, (b) The Tariff Act of 18£8, 
(c) The Sherman Act of 1890, (d) The Panic of 1837, (e) The 
origin and growth of Labor Unions.** 

The Buckingham tests are four in number, two being thought 
tests and two information tests « In the thought test. Series 
A, he asks, for example, **Under what topic would you place 
the following event: Missouri Compromiset Bred Scott Becision, 
Uncle Tom's Cabin, and the Thirteenth Amendment to the Consti- 
tution?** In the Information Series he asks such questions 
as, **?rom what nation did the settlers along the Hudson come?** 

Van Wagenen in his Information Scales asks such questions 
as, **What people did Columbus find in America? Who was Presi- 
dent of the United States during the Civil War?** In his 
Thought Scales a quotation or statement is given and a question 
asked concerning it, for example, **In 1829-30, it took over 
160 hours of work to raise 50 bushels of wheat; in 1895-96, it 
took less than seven and a half hours to raise the same amount. 
How can you account for the difference?** 

Thus it is seen that there are two distinct ends or abili- 
ties to be tested by the various tests: (1) Information con- 
cerning the facts of history, and, (S) the higher mental 
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The problem in relation to these tests is to try and 
determine the relative merits of the different soales, and, 
80 far as possible, to find out their validity, reliability, 
and usefulness. Very little has been accomplished in this 
field, although the individual tests have been given to a 
sufficient number of students to determine their possibilities, 
if not their practicability. 

In an attempt to solve this problem, the Starch, Bucking- 
ham Information Series 1 and Thought Series A, Harlan, and 
Bell and tfcCullom tests were given in the spring of 19S0 to 
sixty students of the Senior Class of the Stevens Point, Wis- 
consin High School. These students were all in classes in 
United Stated history conducted by the author, and the tests 
were given after the entire work of the course had been com- 
pleted. Muszey's An American History was the text used as a 
basis for the work of the students. The tests were given at 
intervals of two or three days, and forty-five minutes allowed 
for completive the work. The next spring (1921) the work was 
repeated with sixty-four students of the Senior Class of that 
year in the same school under the same conditions, except that 
the Van Wagenen Scales, Thought Series A, and Information 
Series A, and the Barr Series A, Test IT were added to those 
given in 1920. The attainments of the individual students in 
the various tests were then worked out and compared with the 
grades assigned in history and with the av'erage of the grades 
in all other subjects. The results obtained are given below 
in Table 1 and Table 11. 
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TIBLB 1 

19E0 : History :▲▼. Other: Staroh :Buok. :Bnolc. :Ear- :Bell & 
Pgpil ; Sradte : Sradea : ;Thot.:Inf« ;lan dioCttllom 
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TIBLB 1 
( Continued) 



1920 : El8tor7:AT. Other. Starch. Baok. :Baolc. :Ear-«Bell & 
Pgpll : Grades »Qradee : I Thot : Tti-p. :lan tMoOollom 

49 : 91 : 98 : 7g ; 90 : 84 ; 86 : 1710 



^ At. I 84.6 : 8g.5 ; 50.g : 79.9 t 64. :77.g: 1489.4 
8.D. ; 56.1 t 4.6 : 80.6 : 14.7 : 19.7;12.4: g78.e 
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TABLB 11 
( Continued) 
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In order to understand these tables an explanation of the 

narlcB assigned is neoessary* The **passing" grade of the school 
is 75, and all grades are assigned upon this basis. The 
grade assigned in the Staroh test represents the aotual nxmber 
of blanks filled in oorreotly out of a possible 134. The 
grades in the Harlan test represent the percentages upon the 
basis of a perfect score of 100 , each question being graded 
on the scale of 10. The Buckingham Thought Test was graded 
upon the scale of 100, each question being graded upon the 
basis of £0; and in the Buckingham Information Test each of 
the twelve questions was marked on the scale of B. The Van 
Wagenen Scales were graded according to the values assigned 
by the author, in the Information Scale a perfect score being 
102, and in the Thought Scale 87« In the Barr test a perfect 
score is 20, each question being assigned the value of 2, ex- 
cept two questions which were given the value of 5. The Bell 
and McCollum Scale gives in detail the value for each question, 
the perfect score being 2642. 
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7 
In order to determine the relationship between the grades 
assigned in history and the yarious tests; the relationship be- 
tween the marks given in the other subjeots and the attainments 
in the history tests; and the relationship between the tests 
themselves, the Pearson ooeffioients were worked out, and are 
given below in Table 111 and IT. 
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In oonduotlng an experiment of this kind there are several 

results one would expert to find: (1) taking for granted that the 
grades of the pupils were assigned fairly, one would expeot to 
find the arerages of the two groups in fairly elose agrvement; 
(2) if the tests are accurate measures of attainment and ability, 
one would be led to believe he would find a fairly close relation- 
ship between the two groups in the various tests; (S) if the 
marks assigned in history represent fairly accurately the histori- 
cal ability of the students, and the tests form a means for 
measuring that ability, then one would expeot to find a rather 
high correlation between the grades assigned in history and the 
marks given on the tests; (4) if the above were true of the his- 
tory grades, and the grades assigned in other subjects represent 
a fair mark of the attainments and abilities of the students in. 
the studies, then one would expect to find a fairly high correla- 
tion between the history grades and the grades assigned in other 
subjects, since it is generally accepted that, on the whole, 
abilities are general and are not confined to particular subjects; 

(5) if the above is true of the grades in other subjects, one 
would expect to find a fairly high correlation between the grades 
in .the other subjects and the attainments in the history tests; 

(6) if the history tests are of fairly equal value in measuring 
the historical ability of the students, one would expect to find 
a fairly high correlation between the results obtained in the 
various tests. 

The obverse of the above suppositions or expectations would 
also be true; that is, those grades and those tests which were 
the least fair measures of the students' attainments and abilities 
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9 
would show 1»Tit little relationship and oorrelatlon. That state** 

ment may not 'he true entirely, 1»ut with the number of tests used 

and the number of pupils tested the results obtained would give 

at least an approximate knowledge of the relative values of the 

different tests slnoe all of them are tests for praetieally the 

same thing* 

In referring to the results obtained, those sixty pupils 
tested in the year 19E0 will be spoken of as Group 1, and those 
8ixty<>four tested in 19£1 will be referred to as Group 11* 

Upon examination of the tables to determine how the aetual 
results measure up to our expeoted results we find: (1) the aver- 
age of Group 1 in history to be 64.56, with a standard deviation 
of 5.1, and the average of Group 11 to be 8S«46 with a standard 
deviation of 5.9, showing a differ enoe of 2.1 in the average of 
the two groups and a dlfferenoe of .8 in the standard deviations. 
The average in the other grades in Group 1 is 83 .5 with a standard 
deviation of 4#6, and the average of Group 11 is 82.34 with a 
standard deviation of 5.08 showing a dlfferenoe of but 1.16 between 
the averages and .48 in the standard deviations. These figures, 
then, indicate a fairly close agreement in the distribution of 
the grades between the two groups. 

(2) In comparing the averages attained in the different tests 
by the two groups, the Van Wagenen and Barr tests must be left out 
of consideration since they were given only to Group 11. The 
average in the Starch test for Group 1 was 60.3 with a standard 
deviation of 20.6, and for Group 11 the average was 36.2 with a 
standard deviation of 16.6, showing a difference of 14.1 in the 
averages and 6. in the standard deviations. Group 1 averaged 
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10 
78*68 in the Buoklnghem Thot test with a standard deriatlon of 

14«7, and Group 11 averaged 78.66 with a standard deviation of 
10.9 showing a differenoe of .23 between the averages and S.8 be- 
tween the standard deviations. The averages for the Buokinghsm 
Information test were 64. and 7S.73 respeotively with standard 
deviations of 19.7 and 11.9 showing differenoes of 9.73 and 7.6. 
The averagds for the Harlan test were 77. £6 and 74.15 with stand- 
ard deviations of 1£.4 and 9.6, showing differenoes of S.l and 
2.9. The Bell and MoCullom tests showed averages of 1489.4 and 
140£.S with standard deviations of 378.8 and 834.8 whioh made the 
differenoes 87.1 and 44. Prom these statements it will 1»e noted 
that on the whole Group 1 did slightly better than Group 11 1»oth 
as regards the marks assigned and the attainments in the tests; 
and that a greater variation exists among the attainments in the 
tests than is represented 'hj the grades. This differenoe is 
pro1»a1»l7 due to the faot that the grading system oonfines the 
marks to the narrow limits Itetween 70 or 75 and a1»out 95, whioh 
marks represent more narrowly the ranges of individual a'bilitieS 
expressed in wider ranges 1»y the attainments in the tests. 

In referring to the oorrelations found, the standard of Bugg 
in his Btatistioal Methods Applied to Eduoation is used. On page 
£56 he states that he regards oorrelations as "negligible** or 
** Indifferent" when less than .15 to .20; as 1»eing "present 1»ut low" 
it ranges from .15 or .20 to .35 or .40; as 'being "marked" when 
it ranges from .35 or .40 to .50 or .60; as being high when it is 
above .60 or .70. 

(3) The oorrelations between the grades assigned in history 
and the various tests in Group 1 range from .30 in the Buoking- 
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ham Tliot test to .72 In the Harlan test with an average oorrela- 
tion ot .66it^ This shows that some correlation exists 'between 
the grades in history and all the tests. In Group 11 a wider 
range of correlation exists » the lowest being with the Barr test 
which was .12* and the highest with the Bell and MoCull&m which 
was .65 with an arerage correlation of «48« In comparing the 
two groups the correlations are somewhat similar, being practically 
identical in the Bell and McCullom test and varying from .04 to 
•13 in the others. In the two groups the Harlan and Bell and Mc 
Cullom tests showed the highest correlations which may be consid*- 
ered ^'high^, and the lowest correlations were with the Buckingham 
Thot and Barr tests, which may be considered as "^low". However, 
it must be remembered that these two teste which showed low oorrela* 
tions were thought or judgment tests, and that perhaps the teacher 
assigned his grades according to ability of the pupils to give 
back facts alone, and that if the grades were assigned according 
to the ability tested in these tests then the situation would be 
reversed. I do not believe that to be the case and the reason for 
that belief will be stated under the discussion of the correlations 
among the tests themselves. 

(4) The correlations between the marks assigned in history 
and the marks in other subjects in Group 1 is .56 and in Group 
11 is .72 , or an average of .64. These figures represent 
rather high correlations and show that the measures of ability 
indicated by the history teacher corresponded rexy closely with 
the measures of ability given by the teachers in the other subjects. 

(6) The correlations between the averages of the other grades 
and the various tests are similar, though somewhat lower, to the 
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oorrelatlons between the history grades and the tests. Howerer 

Bome differences are apparent. The highest oorrelatlon In Group 

1 Is with the Harlan test as It was In the oase of the history 

grades, Iftut the lowest Is with the Staroh test Instead of the 

Buckingham Thot« In (xronp 11, the highest oorrelatlon Is with 

the B«ll and MoCullom test and the lowest with the Bnoklngham 

Thot Instead of the Barr test. The average oorrelatlon In Group 

1 Is .40, and In Group 11 43 S/S showing a "marked** oorrelatlon 

between the arerages In the grades In the other studies and the 

history tests. 

(6) Upon examination of the correlations Itetween each test 

with the other tests, one Is Impressed Ity the wide variation, 

which rangeA all the way from a negative .08 Itetween the Starch 

and Barr test In Group 11 to .82 between the Harlan and Bell and 

McCullom tests In Group 1. One would expect to find the correla<> 

tlon greatest between those tests most nearly alike, and least be* 

tween those tests which are different In type and character. In 

general, this Is true. The greatest correlations exist between 

those Informational tests which are very much alike, such as the 

Harlan and Bell and McCullom, and the lowest correlations are be<> 

tween those tests which are dissimilar, such as the Barr and Starch 

tests. However, the correlations between the thought tests with 

each other are not so high as In the oase of the Information tests. 

The highest average correlation In Group 1 Is the Harlan test with 

.59, the lowest being the Buckingham Thot test with 31 2/3. In 

Group 11 the highest average correlation Is the Bell and McCullom 

with 48 7/9, and the liwest the Barr test with .09 l/3. Had the 

whole series of the Barr tests been given, no doubt a higher 
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correlation would have been found. 

The resulte of this study indioate that those tests which 
measure the most objeotive thing in History^ namely the aliility 
to assooiate properly the essential or outstanding faots are the 
most satisfactory^ and on the whole give the best results; and 
it seems to the writer that this is the fundamental thing to be 
measured* The ability to use those facts critically and correct- 
ly is the outstanding end to be attained. The difficulty of 
scaling and the difference of opinion make such questions as the 
following, taken from the Barr test, of minor ralue in estimating 
the pupil's historical ability : 

''Put a cross ( ) before the three names of men in the follow- 
ing list who have been of the greatest importance in the deyelop* 
ment of the American Bation: 

(a) Dred Scott 

(b) Commodore Perry 

(c) John Jay 

(d) Alexander Hamilton 

(e) Captain John Smith 

(f) Stonewall Jackson 

(g) Thomas Jefferson 

(h) William Henry Harrison 

(i) Andrvw Jackson ** 
Answering the above question is like txyinp: to arrive at a 
satisfactory conclusion whether Washington or Lincoln was the 
greatest American* The above calls for knowledge, to be sure, 
also historical judgment, and some thought and reasoning. But are 
the types of reasoning and historical Judgment the kind that a 
study of history aims to develop? Isn't the aim of history to 
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preaent the great oliaracters and facts of our oountry to boys 
end girls in such a way that they will use those facts and ideas 
and ideals to make better citizens out of themselves^ to have a 
greater love for country, to have a wider outlook upon life, with 
a broader and more intelligent point of view concerning present 
conditions and situations in the country as a whole? These ele- 
ments of citizenship, patriotism, and widened viewpoint are con- 
sidered to be outstanding aims of history, and yet, can they be 
measured by putting marks upon a paper? Hardly. But in*8o-far 
as history contributes to a realization of those aims along with 
the development of memory, Judgment, reasoning, and association, 
the fundamental requirement is a usable knowledge of the essential 
facts of United States history, and the test which measures that 
fundamental knowledge in the most efficient and economical way 
best serves the purpose as a means for measuring historical ability < 

In conclusion, a word concerning the general values and 
general criticisms of the tests is perhaps not out of place. The 
outstanding values seem to the writer to be: (1) A check upon 
instruction and a means by which the teacher of history may 
measure his estimate of the pupils' performance; (2) improvement 
of instruction by showing wherein pupils are weak and wherein lack 
of emphasis has been laid; (3) as a means of review, showing not 
only the teacher but the pupils wherein additional work must be 
done; (4) as a means of revealing olass and individual differences 
which stimulates and aids in study. 

The criticisms are of a constructive nature and represent 
some outstanding features, namely: (1) constant revision to in* 
elude more exercises relating to events of receht date; (2) Divl* 
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slon of some of the tests into periods » thereby shortening 

the tests theioselTes and making possible the using of them 

from time to time during the sohool year; (3) elimination of 

some of the subjeot matter whioh is not of vital importanoe; 

(4) making some of the tests more available by reducing their 

oost* !l!he above are given by the writer as suggestions whioh 

will increase the value of the tests and which will bring about 

a decided increase in the efficiency of class-room instruction 

that is the end and object sought for* 



Digitized by 



Google 



Digitized by 



Google 



Professor of Educi 
August I 1921 



Approved 

Professor of Education 



Digitized by 



Google 



Digitized by 



Google 






Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by VjOOQIC 






Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



» \ 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by' 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Goo<?Ie 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



y" 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



X " 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitizegl by 



Google 



Qigitized by 



Google 



/^ 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



Digitized by 



Google 



V 



Digitized by 



Google 



Digitized by 



Google 



\ 



Digitized by 



Google 



nniiiiii 

B89085989648A 



Digitized by 



Googl3 




Digitized by 



Google 



Digitized by 



Google 



